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ABSTRACT 

Given a task T, a set of experts V with multiple skills and a 
social network G(V,W) reflecting the compatibility among 
the experts, team formation is the problem of identifying 
a team C C V that is both competent in performing the 
task T and compatible in working together. Existing meth¬ 
ods for this problem make too restrictive assumptions and 
thus cannot model practical scenarios. The goal of this pa¬ 
per is to consider the team formation problem in a realistic 
setting and present a novel formulation based on densest 
subgraphs. Our formulation allows modeling of many natu¬ 
ral requirements such as (i) inclusion of a designated team 
leader and/or a group of given experts, (ii) restriction of the 
size or more generally cost of the team (iii) enforcing locality 
of the team, e.g., in a geographical sense or social sense, etc. 
The proposed formulation leads to a generalized version of 
the classical densest subgraph problem with cardinality con¬ 
straints (DSP), which is an NP hard problem and has many 
applications in social network analysis. In this paper, we 
present a new method for (approximately) solving the gen¬ 
eralized DSP (GDSP). Our method, FORTE, is based on 
solving an equivalent continuous relaxation of GDSP. The 
solution found by our method has a quality guarantee and 
always satisfies the constraints of GDSP. Experiments show 
that the proposed formulation (GDSP) is useful in modeling 
a broader range of team formation problems and that our 
method produces more coherent and compact teams of high 
quality. We also show, with the help of an LP relaxation of 
GDSP, that our method gives close to optimal solutions to 
GDSP. 

Categories and Subject Descriptors 

H.2.8 [Database Management]: Database Applications— 
Data mining ; G.2.2 [Discrete Mathematics]: Graph The¬ 
ory —Graph algorithms 

General Terms 

Algorithms, Experimentation, Theory 

Keywords 

Team formation, Social networks, Densest subgraphs 


1. INTRODUCTION 

Given a set of skill requirements (called task T), a set of 
experts who have expertise in one or more skill, along with a 
social or professional network of the experts, the team forma¬ 
tion problem is to identify a competent and highly collabo¬ 
rative team. This problem in the context of a social network 
was first introduced by [18] and has attracted recent interest 
in the data mining community [15| |5j [l2]. A closely related 
and well-studied problem in operations research is the as¬ 
signment problem. Here, given a set of agents and a set of 
tasks, the goal is to find an agent-task assignment minimiz¬ 
ing the cost of the assignment such that exactly one agent is 
assigned to a task and every task is assigned to some agent. 
This problem can be modeled as a maximum weight match¬ 
ing problem in a weighted bipartite graph. In contrast to the 
assignment problem, the team formation problem considers 
the underlying social network, which for example models 
the previous collaborations among the experts, while form¬ 
ing teams. The advantage of using such a social network is 
that the teams that have worked together previously are ex¬ 
pected to have less communication overhead and work more 
effectively as a team. 

The criteria explored in the literature so far for measur¬ 
ing the effectiveness of teams are based on the shortest path 
distances, density, and the cost of the minimum spanning 
tree of the subgraph induced by the team. Here the den¬ 
sity of a subgraph is defined as the ratio of the total weight 
of the edges within the subgraph over the size of the sub¬ 
graph. Teams that are well connected have high density val¬ 
ues. Methods based on minimizing diameter (largest short¬ 
est path between any two vertices) or cost of the spanning 
tree have the main advantage that the teams they yield are 
always connected (provided the underlying social network 
is connected). However, diameter or spanning tree based 
objectives are not robust to the changes (addition/deletion 
of edges) in the social network. As demonstrated in 12 
using various performance measures, the density based ob¬ 
jective performs better in identifying well connected teams. 
On the other hand, maximizing density may give a team 
whose subgraph is disconnected. This happens especially 
when there are small groups of people who are highly con¬ 
nected with each other but are sparsely connected to the 
rest of the graph. 

Existing methods make either strong assumptions on the 
problem that do not hold in practice or are not capable of in¬ 
corporating more intuitive constraints such as bounding the 
total size of the team. The goal of this paper is to consider 


the team formation problem in a more realistic setting and 
present a novel formulation based on a generalization of the 
densest subgraph problem. Our formulation allows model¬ 
ing of many realistic requirements such as (i) inclusion of a 
designated team leader and/or a group of given experts, (ii) 
restriction on the size or more generally cost of the team (iii) 
enforcing locality of the team, e.g., in a geographical sense 
or social sense, etc. In fact most of the future directions 
pointed out by [l2] are covered in our formulation. 

2. RELATED WORK 

The first work 18l in the team formation problem in the 
presence of a social network presents greedy algorithms for 
minimizing the diameter and the cost of the minimum span¬ 
ning tree (MST) induced by the team. While the greedy 
algorithm for minimizing the diameter has an approxima¬ 
tion guarantee of two, no guarantee is proven for the MST 
algorithm. However, fl8] impose the strong assumption that 
a skill requirement of a task can be fulfilled by a single per¬ 
son; thus a more natural requirement such as “at least k ex¬ 
perts of skill s are needed for the task” cannot be handled by 
their method. This shortcoming has been addressed in 12 , 
which presents a 2-approximation algorithm for a slightly 
more general problem that can accommodate the above re¬ 
quirement. However, both algorithms cannot handle an up¬ 
per bound constraint on the team size. On the other hand, 
the solutions obtained by all these algorithms (including the 
MST algorithm) can be shown to be connected subgraphs if 
the underlying social graph is connected. 

Two new formulations are proposed in [15] based on the 
shortest path distances between the nodes of the graph. The 
first formulation assumes that experts from each skill have to 
communicate with every expert from the other skill and thus 
minimizes the sum of the pairwise shortest path distances 
between experts belonging to different skills. They prove 
that this problem is NP-hard and provide a greedy algo¬ 
rithm with an approximation guarantee of two. The second 
formulation, solvable optimally in polynomial time, assumes 
that there is a designated team leader who has to communi¬ 
cate with every expert in the team and minimizes the sum 
of the distances only to the leader. The main shortcoming 
of this work is its restrictive assumption that exactly one 
expert is sufficient for each skill, which implies that the size 
of the found teams is always upper bounded by the number 
of skills in the given task, noting that an expert is allowed 
to have multiple skills. They exploit this assumption and 
(are the first to) produce top-fc teams that can perform the 
given task. However, although based on the shortest path 
distances, neither of the two formulations does guarantee 
that the solution obtained is connected. 

In contrast to the distance or diameter based cost func¬ 
tions, [l2] explore the usefulness of the density based ob¬ 
jective in finding strongly connected teams. Using various 
performance measures, the superiority of the density based 
objective function over the diameter objective is demon¬ 
strated. The setting considered in [l2] is the most general 
one until now but the resulting problem is shown to be NP 
hard. The greedy algorithms that they propose have ap¬ 
proximation guarantees (of factor 3) for two special cases. 
The teams found by their algorithms are often quite large 
and it is not straightforward to modify their algorithms to 
integrate an additional upper bound constraint on the team 
size. Another disadvantage is that subgraphs that maximize 


the density under the given constraints need not necessarily 
be connected. 

Recently [ 2 ] considered an online team formation problem 
where tasks arrive in a sequential manner and teams have 
to be formed minimizing the (maximum) load on any ex¬ 
pert across the tasks while bounding the coordination cost 
(a free parameter) within a team for any given task. Ap¬ 
proximation algorithms are provided for two variants of co¬ 
ordinate costs: diameter cost and Steiner cost (cost of the 
minimum Steiner tree where the team members are the ter¬ 
minal nodes). While this work focusses more on the load 
balancing aspect, it also makes the strong assumption that 
a skill is covered by the team if there exists at least one 
expert having that skill. 

All of the above methods allow only binary skill level, i.e., 
an expert has a skill level of either one or zero. 

We point out that many methods have been developed in 
the operations research community for the team formation 
problem, |5, 9, 21,[20], but none of them explicitly consid¬ 
ers the underlying social or professional connections among 
the experts. There is also literature discussing the social as¬ 
pects of the team formation Roland their influence on the 
evolution of communities, e.g., [4|. 

3. REALISTIC TEAM FORMATION IN SO¬ 
CIAL NETWORKS 

Now we formally define the Team Formation problem that 
we address in this paper. Let V be the set of n experts and 
G(V, IT) be the weighted, undirected graph reflecting the re¬ 
lationship or previous collaboration of the experts V. Then 
non-negative, symmetric weight Wij G IT connecting two 
experts i and j reflects the level of compatibility between 
them. The set of skills is given by A — {an,..., a v }. Each 
expert is assumed to possess one or more skills. The non¬ 
negative matrix M G R nXp specifies the skill levels of all 
experts in each skill. Note that we define the skill level on 
a continuous scale. If an expert i does not have skill j, then 
Mij = 0. Moreover, we use the notation Mj G R nxl for 
the j— th column of M, i.e. the vector of skill levels corre¬ 
sponding to skill j. A task T is given by the set of triples 
{(aj, Kj, tj)}^ =1 , where aj G A, specifying that at least Kj 
and at most tj of skill aj is required to finish the given task. 

Generalized team formation problem. Given a task 
T, the generalized team formation problem is defined as 
finding a team C C V of experts maximizing the collabora¬ 
tive compatibility and satisfying the following constraints: 

• Inclusion of a specified group: a predetermined 
group of experts S C V should be in C. 

• Skill requirement: at least Kj and at most tj of skill 
aj is required to finish the task T. 

• Bound on the team size: the size of the team should 
be smaller than or equal to b, i.e., \C\ <6. 

• Budget constraint: total budget for finishing the 

task is bounded by B, i.e., Ci — where d G 

R+ is the cost incurred on expert i. 

• Distance based constraint: the distance (measured 
according to some non-negative, symmetric function, 
dist) between any pair of experts in C should not be 
larger than do, i.e., dist(w, v) < do,\/u,v G C. 


Discussion of our generalized constraints. In con¬ 
trast to existing methods, we also allow an upper bound 
on each skill and on the total team size. If the skill ma¬ 
trix is only allowed to be binary as in previous work, this 
translates into upper and lower bounds on the number of 
experts required for each skill. Using vertex weights, we can 
in fact encode more generic constraints, e.g., having a limit 
on the total budget of the team. It is not straightforward to 
extend existing methods to include any upper bound con¬ 
straints. Up to our knowledge we are the first to integrate 
upper bound constraints, in particular on the size of the 
team, into the team formation problem. We think that the 
latter constraint is essential for realistic team formation. 

Our general setting also allows a group of experts around 
whom the team has to be formed. This constraint often 
applies as the team leader is usually fixed before forming the 
team. Another important generalization is the inclusion of 
distance constraints for any general distance functioiQ Such 
a constraint can be used to enforce locality of the team e.g. 
in a geographical sense (the distance could be travel time) 
or social sense (distance in the network). Another potential 
application are mutual incompatibilities of team members 
e.g. on a personal level, which can be addressed by assigning 
a high distance to experts who are mutually incompatible 
and thus should not be put together in the same team. 

We emphasize that all constraints considered in the liter¬ 
ature are special instances of the above constraint set. 


Measure of collaborative compatiblity. In this paper 
we use as a measure of collaborative compatibility a gener¬ 
alized form of the density of subgraphs, defined as 


density(C) := 


assoc (C) 
vol g(C) 


^2i,jCC Wi i 


(1) 


where Wij is the non-negative weight of the edge between i 
and j and volg(C) is defined as XascS*’ with 9i being the 
positive weight of the vertex i. We recover the original den¬ 
sity formulation, via gt = 1, Vi £ V. We use the relation, 
assoc(U) = vold(C') — cut(C, V\C), where di = X]"=i w ij is 
the degree of vertex i and cut(A, B ) := YhieA j SB Wij- 


Discussion of density based objective. As pointed 
out in 12 , the density based objective possesses useful prop¬ 
erties like strict monotonicity and robustness. In case of the 
density based objective, if an edge gets added (because of 
a new collaboration) or deleted (because of newly found in¬ 
compatibility) the density of the subgraphs involving this 
edge necessarily increases resp. decreases, which is not true 
for the diameter based objective. In contrast to density 
based objective, the impact of small changes in graph struc¬ 
ture is more severe in the case of diameter objective 12j. 

The generalized density that we use here leads to further 
modeling freedom as it enables to give weights to the ex¬ 
perts according to their expertise. By giving smaller weight 
to those with high expertise, one can obtain solutions that 
not only satisfy the given skill requirements but also give 
preference to the more competent team members (i.e. the 
ones having smaller weights). 


Problem Formulation. Using the notation introduced 
above, an instance of the team formation problem based on 
the generalized density can be formulated as 


assoc (C) 
max — 

CCV VOl g(C) 

subject to : S C C 

Kj < VO\ Mj (C) <bj, V? € {1,... ,p} 

\C\<b 

vol c (C) < B 

dist(w, u) < do, \/u,v £ C, 


( 2 ) 


Note that the upper bound constraints on the team size 
and the budget can be rewritten as skill constraints and can 
be incorporated into the skill matrix M accordingly. Thus, 
without loss of generality, we omit the budget and size con¬ 
straints from now on, for the sake of brevity. Moreover, 
since S is required to be part of the solution, we can assume 
that dist(u,v) < do,\/u,v £ S, otherwise the above problem 
is infeasible. The distance constraint also implies that any 
u £ V for which dist(u, s) > do, for some s £ S, cannot be a 
part of the solution. Thus, we again assume wlog that there 
is no such u £ V; otherwise such vertices can be eliminated 
without changing the solution of problem 


Our formulation is a generalized version of the classical 
densest subgraph problem (DSP), which has many applica¬ 
tions in graph analysis, e.g., see [l9]. The simplest version 
of DSP is the problem of finding a densest subgraph (with¬ 
out any constraints on the solution), which can be solved 
optimally in polynomial time 13!. The densest-fc-subgraph 
problem, which requires the solution to contain exactly k 
vertices, is a notoriously hard problem in this class and has 
been shown not to admit a polynomial time approximation 
scheme 16 . Recently, it has been shown that the densest 


subgraph problem with an upper bound on the size is as 
hard as the densest-fc-subgraph problem 17 . However, the 


densest subgraph problem with a lower bound constraint 
has a 2-approximation algorithm IT]. It is based on solv¬ 
ing a sequence of unconstrained densest subgraph problems. 
They also show that there exists a linear programming re¬ 
laxation for this problem achieving the same approximation 
guarantee. 

Recently [12] considered the following generalized version 
of the densest subgraph problem with lower bound con¬ 
straints in the context of team formation problem: 


assoc (C) 
max —wTtr 
ccv vol g(C) 

subject to : voImj (C) > Kj, Vj £ (1,... ,p} 


(3) 


where M is the binary skill matrix. They extend the greedy 
method of [l7j and show that it achieves a 3-approximation 
guarantee for some special cases of this problem. [8| re¬ 
cently improved the approximation guarantee of the greedy 
algorithm of 12 for problem to a factor 2. The time 
complexity of this greedy algorithm is 0(kn 3 ), where n is 
the number of experts and k := kj is the minimum 

number of experts required. 


1 The distance function need not satisfy the triangle inequal 
ity. 


Direct integration of subset constraint. The subset 
constraint can be integrated into the objective by directly 
working on the subgraph G' induced by the vertex set V' = 










V\S. Note that any C C V that contains S can be written 
as C = A U S, for A C V'. We now reformulate the team 
formation problem on the subgraph G'. We introduce the 
notation m = \V'\, and we assume wlog that the first m 
entries of V are the ones in V'. 

The terms in problem 0 can be rewritten as 

assoc(C) = assoc(A) + assoc(S') + 2 cut(A, S), 

= vol d (A) — cut(A, U\A) + assoc(S') + 2 cut(A, S) 
= vol d (A) — cut(A, V , \j4) + assoc(S) + cut(A, S) 

VOl g(C) = VOlg(A) + VOlg(S) 


Moreover, note that we can write: cut(A, S) = vol d s(A), 
where df = YLjes Wi i denotes the degree of vertex i re¬ 
stricted to the subset S in the original graph. Using the 
abbreviations, ps = assoc(S), us = vol g (S), assocs(A) = 
void (A) — cut(A, U'\A) +ps + vol d s (A), we rewrite the team 
formation problem © as 


assocs(A) 

max — , . —— 

Acv\ A^<6 vol 9 (A) + us 


(GDSP) 


subject to : kj < voW., (A) < lj, Vj € {1,... ,p} 
dist(u, v) < do, Vu, v € A, 


where for all j = 1 the bounds were updated as 

kj = Kj—vol Mj (S), lj = ij—vo\Mj (S). Note that here we al¬ 
ready used the assumption: dist(u, s) < do,\/u £ V, Vs £ S. 
The constraint, A 7 ^ 0, has been introduced for technical rea¬ 
sons required for the formulation of the continuous problem 
in Section 4.2 The equivalence of problem (GDSPI to © 
follows by considering either S (if feasible) or the set A* U S, 
where A* is an optimal solution of (GDSPI, depending on 
whichever has higher density. 

To the best of our knowledge there is no greedy algo¬ 
rithm with an approximation guarantee to solve problem 
|GDSPf. Instead of designing a greedy approximation al¬ 
gorithm for this discrete optimization problem, we derive 
an equivalent continuous optimization problem in Section[4] 
That is, we reformulate the discrete problem in continuous 
space while preserving the optimality of the solutions of the 
discrete problem. The rationale behind this approach is that 
the continuous formulation is more flexible and allows us to 
choose from a larger set of methods for its solution than for 
the discrete one. Although the resulting continuous problem 
is as hard as the original discrete problem, recent progress 
in continuous optimization 14| allow us to find a locally 
optimal solution very efficiently. 


4. DERIVATION OF FORTE 


In this section we present our method, Formation Of Re¬ 
alistic Teams (FORTE, for short) to solve the team for¬ 
mation problem, which is rewritten as {GDSP|, using the 
continuous relaxation. We derive FORTE in three steps: 


i. Derive an equivalent unconstrained discrete problem 
© of the team formation problem ( |GDSP | via an ex¬ 
act penalty approach. 


ii. Derive an equivalent continuous relaxation © of the 
unconstrained problem 0 by using the concept of Lo- 
vasz extensions. 


iii. Compute the solution of the continuous problem 0 
using the recent method RatioDCA from fractional 
programming. 


4.1 Equivalent Unconstrained Problem 

A general technique in constrained optimization is to trans¬ 
form the constrained problem into an equivalent unconstrained 
problem by adding to the objective a penalty term, which 
is controlled by a parameter 7 > 0. The penalty term is 
zero if the constraints are satisfied at the given input and 
strictly positive otherwise. The choice of the regularization 
parameter 7 influences the tradeoff between satisfying the 
constraints and having a low objective value. Large val¬ 
ues of 7 tend to enforce the satisfaction of constraints. In 
the following we show that for the team formation problem 
( GDSP[ ) there exists a value of 7 that guarantees the satis¬ 
faction of all constraints. 


Let us define the penalty term for constraints of the team 
formation problem (|GDSP[) as 


pen(A) := < 




+ E 


=1 max{0, vol_Mj (A) - lj} 

+ Ej=i max{ 0 , kj - vol Mj (A)} 

, max{ 0 , dist(u, v) — do} 


0 


u,v£A 1 


A^0 
A = 0. 


Note that the above penalty function is zero only when A 
satisfies the constraints; otherwise it is strictly positive and 
increases with increasing infeasibility. The special treat¬ 
ment of the empty set is again a technicality required later 
for the Lovasz extensions, see Section |4.2| For the same 
reason, we also replace the constant terms /as and us in 
(GDSP I by ps unit(A) and us unit(A) respectively, where 
unit (A) := 1, A 7 ^ 0 and unit(0) = 0. 

The following theorem shows that there exists an uncon¬ 
strained problem equivalent to the constrained optimization 
problem (GDSPI. 


Theorem 1. The constrained problem \GDSP\ ) is equiv¬ 
alent to the unconstrained problem 


mm 

U/ACV 


vol 9 (A) + us unit(A) + 7 pen(A) 
assocs (A) 


( 4 ) 


for 7 > vol A v ■* vl ^ a lat°(An) ’ where Ao is any feasible set 
of problem 1 GDSF\ such that assocs(Ao) > 0 and 6 is the 
minimum value of infeasibility, i.e., pen(A) > 9, if A is 
infeasible. 


Proof. We define spvol(A) := 


volg (A)-\-us unit(A) 


Note 


_ assocg^-A) 

that maximizing 1 GDSF\ ) is the same as minimizing spvol(A) 
subject to the constraints of \GDSP ). For any feasible sub¬ 
set A, the objective of 0 is equal to spvol(A), since the 
penalty term is zero. Thus, if we show that all minimizers of 
0 satisfy the constraints then the equivalence follows. Sup¬ 
pose, for the sake of contradiction, that A* ( 7 ^ 0, if S = 0} 
is a minimizer of 0 and that A* is infeasible for problem 
\GDSP\) . Since us > 0 and gi > 0,V«, we have under the 
given condition on 7 , 

vol 9 (A*) + us + 7 pen(A*) 7 pen(A*) 


> 


assocs(A*) 
7 9 


assocs(A* 


> 


7 9 vo1 9 (A 0 ) + u s 


> 


maxAcv assocs(A) vol d (U) assocs(Ao) 

which leads to a contradiction because the last term is the 
objective value of 0 at Ao. □ 









































4.2 Equivalent Continuous Problem 

We will now derive a tight continuous relaxation of prob¬ 
lem 0. This will lead us to a minimization problem over 
R m , which then can be handled more easily than the original 
discrete problem. The connection between the discrete and 
the continuous space is achieved via thresholding. Given a 
vector / € R m , one can define the sets 

Ai := {j € V\fj > ft} , (5) 

by thresholding / at the value fi. In order to go from func¬ 
tions on sets to functions on continuous space, we make use 
of the concept of Lovasz extensions. 

Definition 1. (Lovasz extension) Let R : 2' — » R be a 
set function with 7?(0) = 0, and let f G R m be ordered in 
ascending order /i < f 2 < • • • < f m . The Lovasz extension 
R l : R m — ¥ R of R is defined by 

m—1 

R L (f) = E (fi+1 - fi) + R ( y )f i- 

i=1 


good as the one of f. This holds because 


R L (f) = E R(Ai+i)(fi+i - fi) + fiR(v') 


i =1 
m—1 


R(. A i+i) aasocs(i4i + i)(/i + i - fi) 
“ assoc 5 (Ai+i) 
z=i v ' 

R{V') nr's* 

+-assocs(V )/i 

assocs(v') 


> min 


R(Aj) 


3=1,...m aSSOCs(A/) 


E assocs(A i+ i)(/ i+ i - fi) + assocs(I /, )/ij 
assoc s(f) 


R(Aj ) _ L / 


■ mm 
j = l....m assocs(^lj) 


The third step follows from the fact that f is non-negative 
(fi > 0 ) and ordered in ascending order, i.e., fi +1 — fi > 
0,Vi = 1 1. Since assocs(/) is non-negative, the 

final step implies that 


Note that R l (1a) = R(A) for all A C V, i.e. R L is indeed an 
extension of R from 2 V to R v (| V| = m). In the following, 
given a set function R , we will denote its Lovasz extension 
by R L . The explicit forms of the Lovasz extensions used in 
the derivation will be dealt with in Section \4 .31 

In the following theorem we show the equivalence for |GDSPl 
A more general result showing equivalence for fractional set 
programs can be found in [T]. 


Theorem 2. The unconstrained discrete problem 0 is 
equivalent to the continuous problem 


min volg (f) + vs unit 1 ' (A) + 7 pen L (/) 
/eK v' assoc §(/) 


( 6 ) 


for any 7 > 0. Moreover, optimal thresholding of a mini- 
mizer /* G R£, 

a* ._ • volgW + vs +7pen(Aj) 

■Ai • 111111 ^ 

A i= {jgV'|/?>/f },i=l,...,m aSSOCs(Ai) 


> min 


R(A j 


R L (f) . 

assoc s(f) ~ 3 = 1 ,■■■m assoc s(Aj) ’ 


(7) 


Thus we have 


R L U) ^ 

mm - r~rrz > mm 


R(A) 


/ 6 rv' assoCg(/) acv" assocs(A) ’ 

From inequality it follows that optimal thresholding of 
f* yields a set that is a minimizer of problem 0 - D 

Corollary 1. The team formation problem \GDSP ) is 
equivalent to the problem fd) if 7 is chosen according to the 
condition given in Theorem 71 

Proof. This directly follows from Theorems^ and^ □ 


While the continuous problem is as hard as the original 
discrete problem, recent ideas from continuous optimization 
[14] allow us to derive in the next section an algorithm for 
obtaining locally optimal solutions very efficiently. 


yields a set A* that is optimal for problem HP- 

Proof. Let R(A) = vol 9 (A) + izsunit(A) + 7pen(A). 
Then we have 

R(A) . R l (1a) ^ . R L (f) 

mm - =-= = mm - =- -- > mm -=-— 

acv’ assocs(A) a<zv' assocgfln) /<= rv' assoc g(f) 

where in the first step we used the fact that R L (f) and 
assoc L (f) are extensions of R{A) and assoc(A), respectively. 
Below we first show that the above inequality also holds in 
the other direction, which then establishes that the optimum 
values of both problems are the same. The proof of the re¬ 
verse direction will also imply that a set minimizer of the 
problem 0 can be obtained from any minimizer f* of 0 
via optimal thresholding. 


4.3 Algorithm for the Continuous Problem 

We now describe an algorithm for (approximately) solving 
the continuous optimization problem <@. The idea is to 
make use of the fact that the fractional optimization problem 
0 has a special structure: as we will show in this section, 
it can be written as a special ratio of difference of convex 
(d.c.) functions, i.e. it has the form 


mi , ^j s 2 3 H) : =W)’ 

fexY 01 (/) - b 2 {f) 


( 8 ) 


where the functions Ri , R 2 , Si and S 2 are positively one- 
homogeneous convex function 0 and numerator and denom¬ 
inator are nonnegative. This reformulation then allows us 
to use a recent first order method called RatioDCA [l4] |7] . 

In order to find the explicit form of the convex func¬ 
tions, we first need to rewrite the penalty term as pen(A) = 


We first show that the optimal thresholding of any f G R+ 
yields a set A such that 1 a has an objective value at least as 


2 A function / is said to be positively one-homogeneous if 

f(ax) = af(x), a > 0 . 






















pen x (A) — pen 2 (A), where 

peni(A) = 52* =1 vol Mj (A) + Y7j=i k i unit( J 4), 
pen 2 (A) = X]j = imm{/ J ,volM J (^)}+E^i mill { fc i. v olM J (A)} 
“ Eu, w gA max ( 0 > dist(u, v) - d 0 }. 

Using this decomposition of pen(A), we can now write down 
the functions Ri,R 2 , S 1 and S 2 as 

Ri (/) = volp (/) + crmax}/;} 

X 

Ri{f) = 7Pen 2 (/) 

S '1 (/) = vol^ (/) + vole's (/) + fJs max}/;} 

l 

S 2 (f) = cut L (f). 


The reformulation of the problem in the form © en¬ 
ables us to apply a modification of the recently proposed 
RatioDCA [14| f], a method for the local minimization of 
objectives of the form (|8| on the whole R m . Given an 

RatioDCA [14 Minimization of a non-negative ratio of 
one-homogeneous d.c functions over R+ 

1: Initialization: f° £ R+, A 0 = Q(f°) 

2: repeat 

3: f l+1 = a.rgminRi(u)+X l S 2 (u)-(u, r 2 (f l ) + X l si(f 1 )) 

iieny*, ||«|| 2 <i 

where r 2 (f l ) £ dR 2 (f l ), si(f) £ dSi(f l ) 

4: X l+1 = Q(f l+1 ) 

5: until 1 -- L < e 


where p := g + 7^ Mj , cr := v s +7Ej=i k B Pen^ (/) 
denotes the Lovasz extension of pen 2 (A), and 

void}/) = {(hi)Z=i,f) ,where h £ R 71 , 
cut L {f) = | £™=i Wi i I fi- fA- 


Lemma 1. Using the functions Ri, R. 2 , Si and S 2 defined 
above, the problem can be rewritten in the form |8|. The 
functions Ri,R 2 ,Si and S 2 are convex and positively one- 
homogeneous, and Ri — R 2 and Si — S 2 are nonnegative. 

Proof. The denominator of © is given as assoc s(f) = 
vol^/) - cut L (f) + void's}/) + ps unit i (/), and the nu¬ 
merator is given as volg (/) + its unit 1, (A) + 7 pen L (/). Us¬ 
ing Prop.2.1 in 3 and the decomposition of pen(A) intro¬ 
duced earlier in this section, we can decompose pen L (/) = 
pen}"}/) — pen 2 (/). The Lovasz extension of penj(A) is 
given as penf(/) = £? =1 vol^. (/) + ££1 k i max;}/;}, and 
let pen 2 (/) denote the Lovasz extension of pen 2 (A) (an ex¬ 
plicit form is not necessary, as shown later in this section). 
The equality between © and (|8jl then follows by simple 
rearranging of the terms. 

The nonnegativity of the functions Ri — R 2 and Si — S 2 
follows from the nonnegativity of denominator and numera¬ 
tor of ©. and the definition of the Lovasz extension. More¬ 
over, the Lovasz extensions of any set function is positively 
one-homogeneous |3 . 

Finally, the convexity of Ri and Si follows as they are a 
non-negative combination of the convex functions max;}/;} 
and ((hi)iLi, f) for some h £ R". The function S 2 (f) = 
cut L (f) is well-known to be convex [ 3 ]. To show the con¬ 
vexity of R 2 , we will show that the function pen 2 (A) is sub- 
modulai]^] The convexity then follows from the fact that a 
set function is submodular if and only if its Lovasz extension 
is convex ( 3 ]. For the proof of the submodularity of the first 
two sums one uses the fact that the pointwise minimum of a 
constant and a increasing submodular function is again sub- 
modular. Writing D uv := max}0, dist(u, v) — do}, the last 
sum can be written as - J2 u ,veA = ~ J2u£A,veV + 
A,vev'\A D uv- Using (d D ); = Aj, we can write 
its Lovasz extension as - vold D (/)+ § J2i,jev I/* “ fo I > 

which is a sum of a linear term and a convex term. □ 


3 A set function R : 2 V —> R is submodular if for all A, B C 
V, R{A U B) + R(A n B)< R(A) + R(B). 


initialization fo, the above algorithm solves a sequence of 
convex optimization problems (line 3). Note that we do not 
need an explicit description of the terms Si(f) and R 2 (f), 
but only elements of their sudifferential Si(/) £ dSi(f) resp. 
r 2 (f) £ dR 2 (f). The explicit forms of the subgradients are 
given in the appendix. The convex problem (line 3) then 
has the form 


min 
/ ER™ 


\l m 

■J Wi i I £ 

i,j =1 


fo I + (f,c)+c r max}/;}, 
% 


(9) 


where c = p—r 2 (f l )—X l si(f l ). Note that © is a non-smooth 
problem. However, there exists an equivalent smooth dual 
problem, which we give below. 

Lemma 2. The problem © is equivalent to 


1 

min min — 

01 II 00 - 1 *'6 S m 2 





2 

5 

2 


where A : R B R v with ( Aa)i := £L w ij{ a ij ~ a ji)> Pit™ 
denotes the projection on the positive orthant and S m is the 
simplex S m = }u £ R m | w; > 0, 1 «; = !}• 

Proof. First we use the homogenity of the objective in 
the inner problem to eliminate the norm constraint. This 
yields the equivalent problem 


min crmaxit; + — Hull? + (u, c) + 

ugMJ i 2 


x l 71 

— w tt\ Ui -Uj |. 
*0 = 1 


We derive the dual problem as follows: 


X 1 1 

min — wn \ui — uA + amaxu; H— llull? + (u, c) 

uei" 2 2 2 

+ * 0=1 

r x l 71 

= min } max — > wa (ui — ui) an 
ueif 1 i|ct||_<i 2 ^ 


OLA4 = — Ot o' 


* 0=1 


+ max a (u, v) H— 
»6S„ 2 



1 I, 11 2 

= max mm - u L + 
ll“lloo<i **eK+ 2 2 

ves n 


u, c + — Aa + ov 


where (Aa); := E? J u ’*t( a *j — a t*)- The optimization over 
u has the solution u = Pr" (—c — \Aa — av). Plugging u 
into the objective and using that (Pr** (x),x) = ||Pr™ (x) |||, 
we obtain the result. □ 













The smooth dual problem can be solved very efficiently us¬ 
ing recent scalable first order methods like FISTA 6 ], which 
has a guaranteed convergence rate of O(jft), where k is the 
number of steps done in FISTA. The main part in the calcu¬ 
lation of FISTA consists of a matrix-vector multiplication. 
As the social network is typically sparse, this operation costs 
0 (m), where m is the number of non-zeros of W. 

RatioDCA L4], produces a strictly decreasing sequence 
f l , i.e., Q(f l+L ) < Q(f l ), or terminates. This is a typical 
property of fast local methods in non-convex optimization. 
Moreover, the convex problem need not be solved to full 
accuracy; we can terminate the convex problem early, if the 
current f l produces already sufficent descent in Q. As the 
number of required steps in the RatioDCA typically ranges 
between 5-20, the full method scales to large networks. Note 
that convergence to the global optimum of | 8 | cannot be 
guaranteed due to the non-convex nature of the problem. 
However, we have the following quality guarantee for the 
team formation problem. 


Theorem 3. Let Ap be a feasible set for the problem ( [GDSP| ) 
and 7 is chosen as in Theorem Q] Let f* denote the result 
of RatioDCA after initializing with the vector 1a 0; and let 
Af » denote the set found by optimal thresholding of f*. Ei¬ 
ther RatioDCA terminates after one iteration, or produces 
Af * which satisfies all the constraints of the team formation 
problem .GDSP) and 


assocs(A/*) assocs(Ao) 
vol g (A f *) + u s vol 9 (A 0 ) + us 


Proof. RatioDCA generates a decreasing sequ ence {f 1 } 
such that Q(f I+1 ) < Q{f l ) until it terminates \lfj- We have 
Q(/ 1 ) < < 3 ( 1 ^ 0 ), if the algorithm does not stop in one step. 
As shown in Theorem ©) optimal thresholding of f 1 yields a 
set Af that achieves smaller objective on the corresponding 
set function. Since the chosen value of 7 guarantees the 
satisfaction of the constraints, Af has to be feasible. □ 


5. LP RELAXATION OF GDSP 

Recall that our team formation problem based on the den¬ 
sity objective is rewritten as the following GDSP after inte¬ 
grating the subset constraint: 


assocs(A) 
max — , . .. —— 
a cv' volg(A) + us 


( 10 ) 


subject to : kj ft voIm., (A) ft lj, Vj € {1,... ,p} 
dist(u, v) ft do, Mu,v £ A 


Note that here we do not require the additional constraint, 
A 7 ^ 0, that we added to ( |GDSP[ ). In this section we show 
that there exists a Linear programming (LP) relaxation for 
this problem. The LP relaxation can be solved optimally in 
polynomial time and provides an upper bound on the op¬ 
timum value of GDSP. In practice such an upper bound is 
useful to check the quality of the solutions found by approx¬ 
imation algorithms. 


Theorem 4. The following LP is a relaxation of the Gen¬ 
eralized Densest Subgraph Problem (10). 


max 

ten, fe»y', aem E ' 

subject to : tkj ft (Mj, f) ft tlj, Mj £ {1,... ,p} 
fu + fv < t, Mu, v : dist(u, v) > d 0 
t> 0, an ft ft, otij ft fj, M(i,j) £ E' 

0 < ft ft t, Vi £ V', aij > 0, V(i, j) £ E' 
{g,f)+tu s = 1. 

where V' = V\S, E' is the set of edges induced by V'. 


Y, UHjaij + 2 (d s , f) + tp,s ( 11 ) 

i,j =1 


Proof. The following problem is equivalent to be¬ 

cause (i) for every feasible set A of (10 1 , there exist corre¬ 
sponding feasible y, X given by y = 1a, Xij = min{yi,yj}, 
with the same objective value and (ii) an optimal solution of 
the following problem always satisfies X*j = min{y*, y*}. 


2 Ei<,- VHjXij + 2 ( d s ,y) + p s 

max - - - - --- 

s£{0, l} 1 ", XS{0, i} E ' {g, y) + vs 

subject to : kj < (Mj,y) < lj, Mj £ {1,... ,p} 
Vu+Vv < 1, Mu, v : dist(u, v) > d 0 
X-ij ft Vi, Xij ft y, , M(i, j) £ E 


Relaxing the integrality constraints and using the substi¬ 
tution, Xij = an d y i = we obtain the relaxation: 

2 Ei<,- WijOtij +2 (d s ,f)+tp s 

max --- - -—- 

t6R, fem v ', o 6* e/ {g, f) + tvs 

subject to : tkj ft. {Mj, f) ft tlj, Mj £ {1,... ,p} 

fu + fv ft t, Mu, v : dist(u, v) > do 

t > 0, ft ft, ft ft, M(i,j) £ E 1 

0 ft ft ft t, Mi£ V', a^ > 0, M(i,j) £ E 1 


Since this problem is invariant under scaling, we can fix 
the scale by setting the denominator to 1 , which yields the 
equivalent LP stated in the theorem. □ 


Note that the solution f* of the LP (111 is, in general, 
not integral, i.e., /* {0,1}' . One can use standard tech¬ 

niques of randomized rounding or optimal thresholding to 
derive an integral solution from f*. However, the resulting 
integral solution may not necessarily give a subset that sat¬ 
isfies the constraints of (10 1 . In the special case when there 
are only lower bound constraints, i.e., problem © , one can 
obtain a feasible set A for problem ( ]3| by thresholding f* 
(see <©) according to the objective of ( |10[ ). This is possible 
in this special case because there is always a threshold f* 
which yields a non-empty subset Ai (in the worst case the 
full set V') satisfying all the lower bound constraints. In our 
experiments on problem j3j, we derived a feasible set from 
the solution of LP in this fashion by choosing the threshold 
that yields a subset that satisfies the constraints and has the 
highest objective value. 

Note that the LP relaxation is vacuous with respect 
to upper bound constraints in the sense that given / £ R m 
that does not satisfy the upper bound constraints of the LP 


(111 one can construct /, feasible for the LP by rescaling / 
without changing the objective of the LP. This implies that 

















one can always transform the solution of the unconstrained 
problem into a feasible solution when there are only upper 
bound constraints. However, in the presence of lower bound 
or subset constraints, such a rescaling does not yield a fea¬ 
sible solution and hence the LP relaxation is useful on the 
instances of ( | 10 | ) with at least one lower bound or a subset 
constraint (i.e., vs > 0 ). 

6. EXPERIMENTS 

We now empirically show that FORTE consistently pro¬ 
duces high quality compact teams. We also show that the 
quality guarantee given by Theorem [3] is useful in practice 
as our method often improves a given sub-optimal solution. 

6.1 Experimental Setup 

Since we are not aware of any publicly available real world 
datasets for the team formation problem, we use, as in 12 ], 
a scientific collaboration network extracted from the DBLP 
database. Similar to 12], we restrict ourselves to four fields 
of computer science: Databases (DB), Theory (T), Data 
Mining (DM), Artificial Intelligence (AI). Conferences that 
we consider for each field are given as follows: DB = (SIG- 
MOD, VLDB, ICDE, ICDT, PODS}, T = {SODA, FOCS, STOC, 
STACS, ICALP, ESA}, DM = {WWW, KDD, SDM, PKDD, 
ICDM, WSDM}, AI = {IJCAI, NIPS, ICML, COLT, UAI, CVPR} 

For our team formation problem, the skill set is given by 
A ={DB, T, DM, AI}. Any author who has at least three 
publications in any of the above 23 conferences is considered 
to be an expert. In our DBLP co-author graph, a vertex 
corresponds to an expert and an edge between two experts 
indicates prior collaboration between them. The weight of 
the edge is the number of shared publications. Since the 
resulting co-author graph is disconnected, we take its largest 
connected component (of size 9264) for our experiments. 

Directly solving the non-convex problem © for the value 
of 7 given in Theorem |T] often yields poor results. Hence 
in our implementation of FORTE we adopt the following 
strategy. We first solve the unconstrained version of problem 
© (i.e., 7 = 0 ) and then iteratively solve © for increasing 
values of 7 until all constraints are satisfied. I 11 each iter¬ 
ation, we increase 7 only for those constraints which were 
infeasible in the previous iteration; in this way, each penalty 
term is regulated by different value of 7 . Moreover, the so¬ 
lution obtained in the previous iteration of 7 is used as the 
starting point for the current iteration. 

6.2 Quantitative Evaluation 

In this section we perform a quantitative evaluation of 
our method in the special case of the team formation prob¬ 
lem with lower bound constraints and gi = lVi (problem 
©)■ We evaluate the performance of our method against 
the greedy method proposed in 12 / refered to as mdAlk. 
Similar to the experiments of |l2| , an expert is defined to 
have a skill level of 1 in skill j, if he/she has a publication in 
any of the conferences corresponding to the skill j. As done 
in 12 ], we create random tasks for different values of skill 
size, k = {3, 8,13,18, 23, 28}. For each value of k we sample 
k skills with replacement from the skill set A = {DB, T, DM, 
AI}. For example if k = 3, a sample might contain {DB, DB, 
T}, which means that the random task requires at least two 
experts from the skill DB and one expert from the skill T. 

In Figure 1, we show for each method the densities, sizes 
and runtimes for the different skill sizes k, averaged over 10 


random runs. In the first plot, we also show the optimal 


values of the LP relaxation in ( 111 . Note that this provides 
an upper bound on the optimal value of ( |GDSP[ ). We can 
obtain feasible solutions from the LP relaxation of ( |GDSP[ ) 
via thresholding (see Section [5|, which are shown in the 
plot as LPfeas. Furthermore, the plots contain the results 
obtained when the solutions of LPfeas and mdAlk are used 
as the initializations for FORTE (in each of the 7 iteration). 

The plots show that FORTE always produces teams of 
higher densities and smaller sizes compared to mdAlk and 
LPfeas. Furthermore, LPfeas produces better results than 
the greedy method in several cases in terms of densities and 
sizes of the obtained teams. The results of mdAlk+FORTE 
and LPfeas+FORTE further show that our method is able 
improve the sub-optimal solutions of mdAlk and LPfeas 
significantly and achieves almost similar results as that of 
FORTE which was started with the unconstrained solu¬ 
tion of ©. Under the worst-case assumption that the up¬ 
per bound on (GDSP I computed using the LP is the optimal 


value, the solution of FORTE is 94% — 99% optimal (de¬ 
pending on k). 

6.3 Qualitative Evaluation 

In this experiment, we assess the quality of the teams 
obtained for several tasks with different skill requirements. 
Here we consider the team formation problem ( |GDSP[ ) in 
its more general setting. We use the generalized density ob¬ 
jective of 0 where each vertex is given a rank 17 , which 
we define based on the number of publications of the cor¬ 
responding expert. For each skill, we rank the experts ac¬ 
cording to the number of his/her publications in the con¬ 
ferences corresponding to the skill. In this way each expert 
gets four different rankings; the total rank of an expert is 
then the minimum of these four ranks. The main advantage 
of such a ranking is that the experts that have higher skill 
are given preference, thus producing more competent teams. 
Note that we choose a relative measure like rank as the ver¬ 
tex weights instead of an absolute quantity like number of 
publications, since the distribution of the number of pub¬ 
lications varies between different fields. In practice such a 
ranking is always available and hence, in our opinion, should 
be incorporated. 

Furthermore, in order to identify the main area of exper¬ 
tise of each expert, we consider his/her relative number of 
publications. Each expert is defined to have a skill level of 
1 in skill j if he has more than 25% of his/her publications 
in the conferences corresponding to skill j. As a distance 
function between authors, we use the shortest path on the 
unweighted version of the DBLP graph, i.e. two experts are 
at a distance of two, if the shortest path between the corre¬ 
sponding vertices in the unweighted DBLP graph contains 
two edges. Note that in general the distance function can 
come from other general sources beyond the input graph, 
but here we had to rely on the graph distance because of 
lack of other information. 

In order to assess the competence of the found teams, we 
use the list of the 10000 most cited authors of Citeseer 11 . 
Note that in contrast to the skill-based ranking discussed 
above, this list is only used in the evaluation and not in the 
construction of the graph. We compute the average inverse 
rank as in 12 as AIR := 1000 • JA =1 ~k~ ■ where k is the size 
of the team and Ri is the rank of expert i on the Citeseer 
list of 10000 most cited authors. For authors not contained 












Figure 1: Densities, team sizes and runtimes of mdAlk, our method (FORTE), a feasible point constructed 
from the LP (LPfeas), and FORTE initialized with LPfeas and mdAlk, averaged over 10 trials. All versions 
of (FORTE) significantly outperform mdAlk, and LPfeas both in terms of densities and sizes of the teams 
found. The densities of FORTE are close to the upper bound on the optimum of the GDSP given by the LP. 


on the list we set Ri — 10001. We also report the densities 
of the teams found in order to assess their compatibility. 

We create several tasks with various constraints and com¬ 
pare the teams produced by FORTE, mdAlk and LPfeas 
(feasible solution derived from the LP relaxation). Note that 
in our implementation we extended the mdAlk algorithm 
of 12] to incorporate general vertex weights, using Dinkel- 
bach’s method from fractional programming 11 . The re¬ 
sults for these tasks are shown in Table 1. We report the 
upper bound given by the LP relaxation, density value, AIR 
as well as number and sizes of the connected components. 
Furthermore, we give the names and the Citeseer ranks of 
the team members who have rank at most 1000. Note that 
mdAlk could only be applied to some of the tasks and 
LPfeas failed to find a feasible team in several cases. 

As a first task we show the unconstrained solution where 
we maximize density without any constraints. Note that 
this problem is optimally solvable in polynomial time and all 
methods find the optimal solution. The second task asks for 
at least three experts with skill DB. Here again all methods 
return the same team, which is indeed optimal since the LP 
bound agrees with the density of the obtained team. 

Next we illustrate the usefulness of the additional model¬ 
ing freedom of our formulation by giving an example task 
where obtaining meaningful, connected teams is not possi¬ 
ble with the lower bound constraints alone. Consider a task 
where we need at least four experts having skill AI (Task 
3). For this, all methods return the same disconnected team 
of size seven where only four members have the skill AI. 
The other three experts possess skills DB and DM and are 
densely connected among themselves. One can see from the 
LP bound that this team is again optimal. This example 
illustrates the major drawback of the density based objec¬ 
tive which while preferring higher density subgraphs com¬ 
promises on the connectivity of the solution. Our further 


experiments revealed that the subgraph corresponding to the 
skill AI is less densely connected (relative to the other skills) 
and forming coherent teams in this case is difficult without 
specifying additional requirements. With the help of subset 
and distance based constraints supported by FORTE, we 
can now impose the team requirements more precisely and 
obtain meaningful teams. In Task 4, we require that An¬ 
drew Y. Ng is the team leader and that all experts of the 
team should be within a distance of two from each other in 
terms of the underlying co-author graph. The result of our 
method is a densely connected and highly ranked team of 
size four with a density of 3.89. Note that this is very close 
to the LP bound of 3.91. The feasible solution obtained by 
LPfeas is worse than our result both in terms of density 
and AIR. The greedy method mdAlk cannot be applied 
to this task because of the distance constraint. In Task 5 
we choose Bernhard Schoelkopf as the team leader while 
keeping the constraints from the previous task. Out of the 
three methods, only FORTE can solve this problem. It pro¬ 
duces a large disconnected team, many members of which 
are highly skilled experts from the skill DM and have strong 
connections among themselves. To filter these densely con¬ 
nected members of high expertise, we introduce a budget 
constraint in Task 6, where we define the cost of the team 
as the total number of publications of its members. Again 
this task can be solved only by FORTE which produces 
a compact team of four well-known AI experts. A slightly 
better solution is obtained when FORTE is initialized with 
the infeasible solution of the LP relaxation as shown (only 
in this task). This is an indication that on more difficult 
instances of (GDSP I, it pays off to run FORTE with more 
than one starting point to get the best results. The solution 
of the LP, possibly infeasible, is a good starting point apart 
from the unconstrained solution of ©>• 



















Task 

FORTE 

mdAlk 

LPfeas 

Task 1: Uncon¬ 
strained 

(LP bound: 32.7) 

#Comps: 1 (2) Density: 32.7 
AIR: 11.1 

Jiawei Han (54), Philip S. Yu 
(279) 

#Comps: 1 (2) Density: 32.7 
AIR: 11.1 

Jiawei Han (54), Philip S. Yu 
(279) 

#Comps: 1 (2) Density: 32.7 
AIR: 11.1 

Jiawei Han (54), Philip S. Yu 
(279) 

Task 2: DB>3 
(LP bound: 29.8) 

#Comps: 1 (3) Density: 29.8 
AIR: 7.56 

Jiawei Han (54), Philip S. Yu 
(279) (+1) 

#Comps: 1 (3) Density: 29.8 
AIR: 7.56 

Jiawei Han (54), Philip S. Yu 
(279) (+1) 

#Comps: 1 (3) Density: 29.8 
AIR: 7.56 

Jiawei Han (54), Philip S. Yu 
(279) (+1) 

Task 3: AI>4 
(LP bound: 16.6) 

#Comps: 3 (1,3,3) Density: 
16.6 AIR: 10.3 

Michael I. Jordan (28), Jiawei 
Han (54), Daphne Roller (127), 
Philip S. Yu (279), Andrew Y. 
Ng (345), Bernhard Schoelkopf 
(364) (+1) 

#Comps: 3 (1,3,3) Density: 
16.6 AIR: 10.3 

Michael I. Jordan (28), Jiawei 
Han (54), Daphne Roller (127), 
Philip S. Yu (279), Andrew Y. 
Ng (345), Bernhard Schoelkopf 
(364) (+1) 

#Comps: 3 (1,3,3) Density: 
16.6 AIR: 10.3 

Michael I. Jordan (28), Jiawei 
Han (54), Daphne Roller (127), 
Philip S. Yu (279), Andrew Y. 
Ng (345), Bernhard Schoelkopf 
(364) (+1) 

Task 4: AI>4, 

distG(u, v) <2, 
S={Andrew Ng} 
(LP bound: 3.91) 

#Comps: 1 (4) Density: 3.89 AIR: 14.2 

Michael I. Jordan (28), Sebastian Thrun (97), Daphne Roller 
(127), Andrew Y. Ng (345) 

#Comps: 1 (6) Density: 3.5 
AIR: 12.5 

Michael I. Jordan (28), Ge¬ 
offrey E. Hinton (61), Sebas¬ 
tian Thrun (97), Daphne Roller 
(127), Andrew Y. Ng (345), 
Zoubin Ghahramani (577) 

Task 5: AI>4, 

distG(w, v) <2, 
S={B.Schoelkopf} 
(LP bound: 6.11) 

#Comps: 2 (11,1) Density: 3.54 AIR: 3.94 

Jiawei Han (54), Christos Faloutsos (140), Thomas S. Huang (146), Philip S. Yu (279), Zheng 

Chen (308), Bernhard Schoelkopf (364), Wei-Ying Ma (523), Ke Wang (580) (+4) 

Task 6: AI>4, 

distG(«, v) <2, 
S={B.Schoelkopf}, 
Ei Ci <255 
(LP bound: 2.06) 

T^Comps: 1 (4) Density: 1.24 AIR: 1.82 

Alex J. Smola (335), Bernhard Schoelkopf (364) (+2) 

LP+FORTE: #Comps: 2 (2,2) Density: 1.77 AIR: 2.73 

Robert E. Schapire (293), Alex J. Smola (335), Bernhard Schoelkopf (364), Yoram Singer (568) 

Task 7: 3<DB<6, 
DM>10, 

(LP bound: 11.3) 

#Gomps: 1 (10) Density: 9.52 AIR: 4.96 

Haixun Wang (50), Jiawei Han (54), Philip S. Yu (279), Zheng Chen (308), Re Wang (580) 

(+5) 

Task 8: 2<DB<5, 

10<DM<15, 

5<AI<10 

(LP bound: 10.7) 

T^Comps: 3 (1,12,3) Density: 7.4 AIR: 5.06 

Michael I. Jordan (28), Jiawei Han (54), Daphne Roller (127), Philip S. Yu (279), Zheng Chen 
(308), Andrew Y. Ng (345), Bernhard Schoelkopf (364), Wei-Ying Ma (523), Divyakant Agrawal 
(591) (+7) 

Task 9: AI<2, 

T>2, |C|<6 
(LP bound: 19) 

#Comps: 3 (2,2,2) Density: 6.17 AIR: 1.53 

Didier Dubois (426), Micha Sharir (447), Divyakant Agrawal (591), Henri Prade (713), Pankaj 

R. Agarwal (770) (+1) 


Table 1: Teams formed by FORTE, mdAlk and LPfeas for various tasks. We list the number and sizes of 
the found components, the (generalized) maximum density as well as the average inverse rank (AIR) based 
on the Citeseer list. Finally, we give name and rank of each team member with rank at most 1000. Experts 
who do not have the skill required by the task but are still included in the team are shown in italic font. 


Tasks 7, 8 and 9 provide some additional teams found by 
FORTE for other tasks involving upper and lower bound 
constraints on different skills. As noted in Section[5]the LP 
bound is loose in the presence of upper bound constraints 
and this is also the reason why it was not possible to derive 
a feasible solution from the LP relaxation in these cases. In 
fact the LP bounds for these tasks remain the same even if 
the upper bound constraints are dropped from these tasks. 

7. CONCLUSIONS 

By incorporating various realistic constraints we have made 
a step forward towards a realistic formulation of the team 
formation problem. Our method finds qualitatively better 
teams that are more compact and have higher densities than 


those found by the greedy method 12j. Our linear program¬ 
ming relaxation not only allows us to check the solution 
quality but also provides a good starting point for our non- 
convex method. However, arguably, a potential downside 
of a density-based approach is that it does not guarantee 
connected components. A further extension of our approach 
could aim at incorporating “connectedness” or a relaxed ver¬ 
sion of it as an additional constraint. 
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APPENDIX 

The subgradient of S 1 (/) is given by Si (/) = d + d s + 
Hslmaxif), where I ma x(f) is the indicator function of the 
largest entry of /. For the subgradient of R 2 , using Prop. 
2.2. in [ 3 ], we obtain for the subgradient of the terms 

of the form min{(/, voImj (A)}, 

'0 vol Mj (A i+1 ) > lj 

(. ( f \\ _) h - v °1mj {A i+ i) vol Mj {Ai)>lj, 

(/))-< vol 4(A i+1 )<Z, ■ 

Mij vol Mj {Ai)<lj 

Defining D uv := max{0, dist(w, v) — do}, an element of the 
subgradient of the second term of R 2 is given as do — p(f), 

where ( d D )i = and p(f)i G {j2jLi D ij u ij \ u ij = 

-Uji,Uij G sign (fi - fj) j, where sgn(a;) := +1, if x > 0; 
-1 if x < 0; [—1,1], if x = 0. In total, we obtain for the 
subgradient r 2 (f) of f? 2 (/), 
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